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(57) Abstract: A system and method for identifying me- 
dia content presented over a media playing device. The 
media content, such as audio and/or video, is either avail- 
able digitally or digitally sampled. The media content is 
sampled to generate a media sample or analytical repre- 
sentation of the media content. The media sample is com- 
pared to a collection of sampled (or represented) media 
content to identify it and to ascertain information related 
to the sample. This media content-related information is 
then presented to the user via a display means on the me- 
dia player. The media player then presents the user spe- 
cific and related actions that are based upon the informa- 
tion presented and allows the user to directly execute their 
choice of actions. 
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P R r T F T C A T I O N 

A METHOD AND APPARATUS FOR IDENTIFYING MEDIA CONTENT 
PRESENTED ON A MEDIA PLAYING DEVICE 

10 

1 . Field of the Invention 

This invention pertains generally to media content identification methods. More 
particularly, the invention is an apparatus and method for identifying media content 
presented on a media playing device and presenting media-content related information 
15 and/or action to the user of the media playing device. 

2. The Prior Art 

The use of a data processing means, such as a computer or personal digital 
assistant (PDA), for playing media content is well known. For example, many users use 
their personal computer (PC) for playing audio compact disks (CD), normally with the 

20 use of a CD-ROM drive, a sound card, and speakers. Other types of media, including 
various formats of audio and video, are also commonly played using a PC. 

With the increasing popularity of the global information network commonly 
known as the "Internet", various audio and video formats have been introduced to 
provide live (as well as archived) media broadcasts over the Intemet. These broadcasts 

25 may be viewed by users connected to the Intemet with the use of a PC and the proper 
client application. For example, Real Networks™ provides Real Player^i^' (a client 
application) for playing streaming audio and video content (in Real™ format) which is 
broadcast over the Intemet. Various servers (also connected to the Intemet) carry out the 
operation of making such content available and streaming the appropriate content to 

30 users upon request. In this way. Real™ media content may be played by a user using 
Real Player™ on the user's PC. Like other media client applications, Real Player 
plays audio content via the user's sound card and speakers and video content via the 
user's video card and monitor (or other viewing device). 

Intemet radio broadcasts (or webcasts) are also known in the art. In general, 

35 Intemet radio broadcasts are provided over the Intemet by one or more server computers 
equipped to provide streaming audio content, much like traditional AM or FM radio 
broadcast. A user that would like to listen to an Intemet radio broadcast would use a 
chent application (such as Real Player^M, Microsoff^^^ media player, or Apple 
QuickTime^M viewer, for example) and direct the client application to the appropriate 

40 server computer. The server computer then transmits the media content to the client 

application via the Internet. The client application receives the media content transmitted 
from the server computer and plays the content using the user's sound card and 
speaker. 

1 
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5 Internet video broadcasts (another form of webcast) are carried out in a similar 

fashion to that of Internet radio broadcasts, except video broadcasts typically contain 
both audio and video content. As such, the audio component is played to the user via a 
sound card and speakers, while the video component is displayed using a video card and 
a monitor. 

10 In many cases, media content webcasts will not provide content identifying 

information, such as song title, aitist name or album name, as part of the media 
broadcast. Other information, such as where the content can be purchased in the form of 
a CD, DVD (digital video disk) or VHS (video home system) tape, for example, are also 
typically not included. The primary reason a webcast provider (station) fails to provide 

15 such content-related infonnation is that the station does not have real-time access to 

accurate playlists. For example, it is common for a station disk jockey to make real-time 
adjustments to the playlist order and timing. Thus current radio station systems do not 
transmit with or attach content data with the broadcast. 

In addition to webcasts, many Internet sites provide archived or pre-recorded 

20 media content for download and playback on a user' s PC. Examples of such content 
include "wav", "mp3", "mov", and "midi" files, among others. Once these files have 
been downloaded from the Internet site to the user's PC, the user is able to play back the 
audio or video using an appropriate client application. Many of these media files, like 
webcasts, also fail to provide content identifying and other information, 

25 Additionally, PCs often have the ability to play other forms of media content via 

PC sound card and speakers and/or PC video card and monitor. For example, tuner 
expansion cards (tuners) which may be inserted into (or interfaced with) a user's PC are 
presently available (some computers already include such tuners). In either case, 
"television" (TV) tuners allow a user of the PC to view conventional television, cable or 

30 sateUite transmissions on the PC monitor and hear the audio via the PC speaker. 

"Radio" tuners allow a user to hear conventional (FM or AM) radio broadcast using 
the PC speaker. Conventional TV and radio broadcast also fail to provide rich content- 
related information to the user. 

Other media playing devices such as televisions, radio/stereo systems, and 

35 portable audio devices also typically do not provide content-related information on 
demand for the user. For example, in automobiles, current car stereo systems fail to 
provide a system and method for identifying the music being played on the stereo. 
Typically a user would either rely on the disk jockey to identify the artist and song title 
or call the radio station to ascertain the information. 

40 Accordingly, there is a need for an apparatus and method which provides real- 

time media content-related information and context specific actions choices to users 
viewing and/or listening to media content over media playing devices. The present 
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.5 invention satisfies these needs, as well as others, and generally overcomes the 
deficiencies found in the background art. 

BRIEF DESCRIPTION OF THE INVENTION 
The present invention is a system and method for identifying media content 
presented over a media playing device, such as a computer. The system generates a 

10 media sample or analytical representation of the media content, such as audio and/or 

video, played on the media player. The media sample or representation is compared to a 
database of sampled media content or representations to query and ascertain information 
related to the sample. This media content-related infonnadon is then displayed on the 
media player. 

15 The invention further relates to machine readable media on which are stored 

embodiments of the present invention. It is contemplated that any media suitable for 
retrieving instructions is within the scope of the present invention. By way of example, 
such media may take the form of magnetic, optical, or semiconductor media. The 
invention also relates to data structures that contain embodiments of the present 

20 invention, and to the transmission of data stmctures containing embodiments of the 
present invention. 

By way of example only, and not of limitation, the media player may comprise 
any data processing means or computer executing the present invention including 
devices carrying out the invention via an embedded system. For example, the media 

25 player may comprise cellular or mobile phones, portable media players, and fixed media 
players including analog broadcast receivers (such as car stereo, home stereos, 
televisions, for example). 

In a first embodiment, the database of sampled media content resides on a 
^'lookup server" which carries out the database query described above. In general, the 

30 lookup server is operatively coupled for communication to one or more media players 
(client) via the Internet as is known in the art. Under this arrangement, the media sample 
is first communicated from the chent to the lookup server before the media sample is 
compared to the database. Then after the database query is carried out on the lookup 
ser\'er, the content-related information is transmitted to the client for presentation 

35 thereon. 

According to this embodiment, the system of the present invention generally 
includes one or more client media players, each operatively coupled for communication 
to a lookup server. As noted above, the lookup server is generally connected to the client 
media players via an Internet connection, although any suitable network (including wired 
40 or wireless) connection may also be used. 

For example, the client media player may comprise a computer. The client 
computer typically plays audio files via a sound card and speakers. Video files are 
played via a video card and a monitor. The client computer has executing thereon a 

-> 
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5 "client engine" software application comprising a sampling unit, a media intercept unit 
(intercept unit), and a user-interface. 

By way of example, and not of limitation, the intercept unit carries out the 
operation of monitoring the client computer for media content presented thereon. In 
general the intercept unit cames out the operation of monitoring audio signals 

10 transmitted to the sound card and/or video signals transmitted to the video card. The 
intercept unit may also monitor the operations of other hardwaie suitable for 
communicating audio and/or video signals, such as USB (universal serial bus ) and IEEE 
(Institute of Electrical and Electronics Engineers) standard 1394 connections, for 
example. Preferably, the intercept unit maintains a FIFO (first in first out) buffer of 

15 fixed size (150 seconds, for example) containing media content played on the client. 

The sampling unit carries out the operation of creating a media sample of the 
media content played on the client computer in order to represent and identify the media 
content. In general, the sampling unit creates a sample from the FIFO buffer maintained 
by the intercept unit. The sampling unit may create media samples according to a 

20 predetermined or user-defined interval or upon the request of the user of the client 

computer. User request may be received via commands issued using the user-interface. 
In general, the media sample created by the sampling unit comprises a "digital 
fingerprint" or "digital signature" using conventional digital signal processing known 
in the art. For example, a sound file may be sampled according to its acoustic/perceptual 

25 features over time. It is noted that "media sample" is defined herein to include actual 
recordings of the media content and/or analytical representations of the media content. 
The media sample is then transmitted to the lookup server for further processing, as 
described in further detail below. 

The user interface carries out the operation of receiving commands from a user 

30 of the client computer, and displaying content-related information to the user. As noted 
above, a user may issue a request for content-related information via the user-mterface. 
This request is communicated to the sampling unit for further processing. As described 
above, in response to this request, the sampling unit creates a sample of the media 
content being played on the client computer and transmits the sample to the lookup 

35 server for further processing. In response, the lookup server provides the information 
related to the media sample to the client computer. This content-related information is 
received by the user interface which then displays the received information to the user of 
the client computer. 

The content-related infonnation returned from the lookup server may also be 

40 used for a plurality of purposes including, for example, generating a log of the user 
activity, providing an option to purchase media to the user, and displaying the content- 
related information on the media playing device, among others. 
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5 The lookup server has executing thereon a "server engine" software application 

comprising a lookup unit, and a log unit. The lookup unit is further coupled to a media 
database, and the log unit is coupled to a log database. While the present invention 
describes the lookup server as a single machine, a group or cluster of server machines 
may be used to caiTV out the tasks of the lookup server described herein to thereby 

10 balance the load between several machines as is known in the art. 

The lookup unit carries out the operation of receiving media samples from client 
computers and performing database queries on the media database to ascertain content 
information related to the media sample provided. This content-related information may 
include such information as song title, artist, album name, for example. The content- 

15 related information may also include product fulfillment information, such as how and 
where to purchase media containing the media sample, advertising banners, and/or 
promotional offers, for example. This content-related information is transmitted back to 
the client computer for further processing thereon. 

The log unit caries out the operation of tracking media requests made by the 

20 users of the client computers to the lookup server. The log unit maintains a plurality of 
information related to the requests made by the user including, for example, media type, 
genre or category. This user information is maintained in the log database. 

The source of the media content played on the client computer includes 
conventional media sources such as Internet sources or Webcasts, including streaming 

25 media and archived media. The media content source may also be audio CDs, DVD, or 
other formats suitable for presentation on media playing devices, such as the client 
computer. 

In a second embodiment of the present invention, the database of sampled media 
content resides within the client computer. The database resides on conventional storage 

30 medium, such a computer memory, a hard drive, CD-ROM, or other appropriate storage 
medium. Under this arrangement, the database query is earned out "locally" on the 
computer playing the media content. It will be readily apparent to those skilled in the art 
that vanous other topological arrangements of the system elements (including the 
location of the database) may be used with the invention without departing from the 

35 scope and spirit of the invention. 

An object of the invention is to provide an apparatus and method for identifying 
media content presented over a media playing device which overcomes the deficiencies 
of the prior art. 

Another object of the invention is to provide an apparatus and method for 
40 identifying media content presented over a media playing device which does not require 
the media content provider to provide content-related information. 

Further objects and advantages of the invention will be brought out in the 
following portions of the specification, wherein the detailed description is for the 

5 



wo 01/62004 



PCT/USOl/04997 



5 purpose of fully disclosing the preferred embodiment of the invention without placing 
limitations thereon. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The present invention will be more fully understood by reference to the 
following drawings, which are for illustrative purposes only. 
10 FIG. 1 is a functional block diagram of a media content identifying system in 

accordance with the invention. 

FIG. 2 IS a functional block diagram of a media content identifying system 
having a plurality of client nodes in accordance with the invention. 

FIG. 3 is a flow chart showing generally the processes associated with the client 
1 5 engine in accordance with the invention. 

FIG. 4 is flow chart showing generally the processes associated with the media 
intercept unit in accordance with the invention. 

FIG. 5 is flow chart showing generally the processes associated with the server 
engine in accordance with the invention. 

20 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Persons of ordinary skill in the art will realize that the following description of 
the present invention is illustrative only and not in any way limiting. Other 

25 embodiments of the invention will readily suggest themselves to such skilled persons 
having the benefit of this disclosure. 

Referring more specifically to the drawings, for illustrative purposes the present 
invention is embodied in the apparatus shown FIG. 1 and FIG. 2 and the method 
outhned in HG. 3 through HG. 5. It will be appreciated that the apparatus may vary as 

30 to configuration and as to details of the parts, and that the method may vary as to details 
and the order of the acts, without departing from the basic concepts as disclosed herein. 
The invention is disclosed generally in terms of an apparatus and method for media 
content identification on a personal computer, although numerous other uses for the 
invention will suggest themselves to persons of ordinary skill in the art, 

35 Referring first to FIG. I, there is generally shown a functional block diagram of 

a media content identifying system 10 in accordance with the invention. The system 10 
comprises a lookup server 12 and at least one client media player 14. The lookup server 
12 can be any standard data processing means or computer, including a minicomputer, a 
microcomputer, a UNIX® machine, a mainframe machine, a personal computer (PC) 

40 such as INTEL® based processing computer or clone thereof, an APPLE® computer or 
clone thereof or, a SUN® workstation, or other appropriate computer. Lookup server 12 
generally includes conventional computer components (not shown), such as a 
motherboard, central processing unit (CPU), random access memory (RAM), hard disk 
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5 drive, display adapter, other storage media such as diskette drive, CD-ROM, flash-ROM, 
tape drive, PCMCIA cards and/or other removable media, a monitor, keyboard, mouse 
and/or other user interface means, a modem, and/or other conventional input/output 
devices. Lookup server 12 also includes a Network Interface 17 for communication 
with other computers using an appropriate network protocol. 

] 0 Lookup server 12 has loaded in its RAM a conventional server operating system 

(not shown) such as UNIX®, WINDOWS® NT, NOVELL®, SOLARIS®, or other server 
operating system. Lookup server 12 also has loaded in its RAM server engine software 
16, which is discussed in more detail below. 

Client media player 14, may compnse a standard computer such as a 

15 minicomputer, a microcomputer, a UNIX® machine, mainframe machine, personal 
computer (PC) such as INTEL®, APPLE®, or SUN® based processing computer or 
clone thereof, or other appropnate computer. As such, client media player 14 is normally 
embodied in a conventional desktop or "tower" machine, but can alternatively be 
embodied in a portable or "laptop" computer, a handheld personal digital assistant 

20 (PDA), a cellular phone capable of playing media content, a dumb terminal playing 

media content, a specialized device such as Tivo® player, or an internet terminal playing 
media content such as WEBTV®, among others. As described above, client media player 
14 may also comprise other media playing devices such as portable stereo system, fixed 
stereo systems, and televisions suitable for use with embedded systems carrying out the 

25 operations of the client engine as described further below. 

Client media player 14 also includes typical computer components (not shown), 
such as a motherboard, central processing unit (CPU), random access memory (RAM), 
hard disk drive, other storage media such as diskette drive, CD-ROM, flash-ROM, tape 
drive, PCMCIA cards and/or other removable media, keyboard, mouse and/or other user 

30 interface means, a modem, and/or other conventional input/output devices. 

CHent media player 14 also has loaded in it RAM an operating system (not 
shown) such as UNIX®, WINDOWS® 98 or the Uke. Client media player 14 further 
has loaded m RAM a media client application program 18 such as Real Player™, 
W^indows'^^* Media Player, Apple Quicktime™ Player, or other appropriate media client 

35 application for playing audio and/or video via client media player 14. Chent media player 
14 also has loaded in its RAM client engine software 30, which is discussed in more 
detail below. 

Client media player 14 further includes a conventional sound card 20 connected 
to speakers 22. As is known in the art, the media client apphcation 18 will generally 
40 play audio signals through the sound card device 20 and speakers 22. For example, 
audio streams, audio files, audio CDs, and other audio sources are communicated from 
the client application 18 to the sound card 20 for output. The sound card 18 then 
produces the audio signal via speakers 22. 

7 
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Client media player 14 also includes a conventional video card 24 connected to a 
display means 26. As is knov^-n in the art, the media client 18 will generally play video 
content through the video card 24, which then produces an appropriate video signal 
suitable for display on the display means 26. The display means 26 may be a standard 

10 monitor, LCD, or other appropriate display apparatus. Client media player 14 also 
includes a Network Interface 28 for communication with other computers using an 
appropriate network protocol. The network interface 28 may comprise a network 
interface card (NIC), a modem, ISDN Ontegrated Services Digital Network) adapter, 
ADSL (Asynchronous digital subscriber line) adapter or other suitable network 

1 5 interface. 

Client media player 14 is networked for communication with lookup server 12. 
Typically, client media player 14 is operatively coupled to communicate with lookup 
server 12 via an Internet connection 32 through a phone connection using a modem and 
telephone line (not shown), in a standard fashion. The user of client media player 14 

20 will typically dial the user's Internet service provider (ISP) (not shown) through the 
modem and phone line to establish a connection between the client media player 14 and 
the ISP, thereby establishing a connection between the client media player 14 and the 
Internet 32. Generally, lookup server 12 is also connected to the Internet 32 via a 
second ISP (not shown), typically by using a fast data connection such as Tl, T3, 

25 multiple T 1 , multiple T3, or other conventional data connection means (not shown). 
Since computers connected to the Internet are themselves connected to each other, the 
Internet 32 establishes a network communication between client media player 14 and 
server 12. Generally, client media player 14 and lookup server 12 communicate using 
the IP (internet protocol). However, other protocols for communication may also be 

30 utiUzed, including PPTP, NetBEUI over TCP/IP, and other appropriate network 
protocols. 

Client media player 14 and server 12 can alternatively connect to the Internet 32 
using a network means, wireless means, satellite means, cable means, infrared means or 
other means for establishing a connection to the Internet, as alternatives to telephone line 

35 connection. Alternative means for networking chent media player 14 and server 12 may 
also be utilized, such as a direct point to point connection using modems, a local area 
network (LAN), a wide area network (WAN), wireless connection, satellite connection, 
direct port to port connection utilizing infrared, serial, parallel, USB, FireWire/IEEE- 
1394, ISDN, DSL and other means known in the art. 

40 In general, client engine 30 is a software application executing within the client 

media player 14. However, client engine 30 may also be an embedded system carrying 
out the functions described herein and suitable for use with various media player 
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5 platforms. Client engine 30 comprises a sampling unit 34, an intercept unit 36, and a 
user interface 38. 

The intercept unit 36 carries out the operation of monitoring the client media 
player 14 for media content presented thereon, typically by media client application 18, 
but may be from any media source. In operation, the intercept unit 36 monitors audio 

10 signals transmitted from the media client 1 8 to the sound card 20 and/or video signals 
transmitted from the media client 18 to the video cai'd 24. The intercept unit 36 may also 
monitor the operations of other hardware (not shown) suitable for communicating audio 
and/or video signals, such as USB, IEEE standard 1394 connections, and analog inputs 
devices (such as microphones). For example, m a media player comprising a mobile 

15 stereo system, the intercept unit 36 intercepts the audio signal played on thereon. 

Preferably, the intercept unit 36 maintains a FIFO (first in first out) buffer 40 of 
fixed size (150 seconds, for example) for storing media content played on the client 
media player 14, and from which media samples are created by sampling unit 34. The 
operadon of the intercept unit is further described below in conjunction with FIG. 4. 

20 The sampling unit 34 carries out the operation of creating a media sample of the 

media content played on the client media player 14. In operation, the sampling unit 34 
creates a media sample from the FIFO buffer 40 maintained by the intercept unit 36. 
The sampling unit 34 may create media samples according to a predetermined or user- 
defined interval or upon the request of user of the client media player 14. For example, 

25 user requests may be received via commands issued using the user-interface 38. 

As noted above, the media sample created by the sampling unit 34 typically 
comprises a "digital fingerprint" using conventional digital signal processing known in 
the art. For example, a sound file may be sampled according to its acoustic/perceptual 
features over time. The media sample generated by sampling unit 34 is then transmitted 

30 to the lookup server 12 for further processing, as described in further detail below. 

The user interface 38 carries out the operation of receiving commands from a 
user of the client media player 14, and displaying content-related information to the user. 
The user interface 38 may be any conventional interfaces including, for example, a 
graphical user interface (GUI), a command line user interface (CLUI), a voice 

35 recognition interface, a touch screen interface, a momentary contact switch on a stereo 
system, or other appropnate user interface. 

In operadon, a user may issue a request for content-related information via the 
user-interface 38. This request is communicated to the sampling unit 34 for further 
processing. As described above, in response to such a request, the sampling unit 34 

40 creates a media sample of the media content being played on the client media player 14 
and transmits the sample to the lookup server 12 for further processing. In response, 
the lookup server 12 provides the information related to the media sample, if available, to 
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5 the client media player. This content-related information is received by the user interface 
38 which then displays the received information to the user of the client media player 14. 

The server engine 16 is a softwai-e application executing withm the lookup server 
12. Server engine comprises a lookup unit 42 and a log unit 44. Lookup unit 42 is 
operatively coupled for conununication with a media database 46. Log unit 44 is 

10 operatively coupled for communication with a log database 48. 

The lookup unit 42 cames out the operation of receiving media samples from the 
client media player 14 and companng the media samples to a collection of sampled 
media content (reference samples). The collection of reference samples will typically 
reside in an in-memory structure (designated 47). Upon initialization of the lookup unit 

15 42, the collection of reference samples from the media database 46 is stored in the in- 
memory structure 47. 

The lookup unit 42 sequentially compares each reference sample in structure 47 
to the media sample provided by the media player 14. The lookup unit will cut the media 
sample into a series of overlapping frames, each frame the exact size of the particular 

20 reference sample under examination. The lookup unit 42 will then compare the reference 
sample to every possible "frame" of information within the media sample and compute 
a "distance" between each frame and the reference sample. This process repeats for 
each reference sample. The reference sample which has the smallest distance to any 
frame in the sample is considered for a match. If this distance is below a predefined 

25 threshold, a match is considered to be found. 

When a match is determined, the information related to the matching record is 
returned to the client media player for presentation thereon. This content-related 
information may include such information as song title, artist, and album name, for 
example. The content-related information may also include product fulfillment 

30 information, such as how and where to purchase media containing the media sample, 
advertising banners, and/or promotional offers, for example. This content-related 
information is transmitted back to the client media player 14 for further processing 
thereon. 

The log unit 44 caries out the operation of tracking media requests made by the 
35 users of the client media player 14 to the lookup server 12. The log unit 44 maintains a 
plurality of information related to the requests made by the user including, for example, 
media type, genre or category. This user information is maintained in the log database 
48 

Media database 46 and log database 48 comprise conventional relational 
40 database storage facilities. Thus, media database 46 may comprise one or more tables 
(not shown) for stonng data associated with the media content samples and related 
content-specific information (song name, title, album name, etc.). In general, another 
computer (not shown) may be connected to the lookup server 12 for entering the 

10 
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5 content-specific media information into the media database 46. Log database 46 may 
comprise one or more tables (not shown) for storing data associated with users and the 
user's related media content requests. 

Referring now to FIG. 2, there is generally shown a functional block diagram of 
a media content identifying system 50 having a plurality of client nodes in accordance 

10 with the invention. 

System 50 compnses a lookup server 54 operatively coupled for communication 
with a plurality of client media players 52a through 52n via Internet connection 32. 
System 50 operates m substantially the same manner as system 10 described above. 
That is, lookup server 54 operates m the same manner as lookup server 12, and clients 

\5 52a through 52n operate in the same manner as client media player 14. However, in 
system 50, lookup server 12 handles a plurahty of media sample identification requests 
from the client 52a through 52n. 

As depicted in FIG. 2, clients 52a through 52n communicate with lookup server 
54 using the HTTP (hypenext transfer protocol) over IP. More particulariy, the media 

20 sample signal generated by each client 52a through 52n is wrapped using a transport 
protocol before transmitted over HTTP to lookup server 54 for processing. When 
lookup server 54 receives the HTTP transmission from the chent media player, a 
transport protocol converts ("unwraps") the signal for processing therein by the server 
engine. Likewise, before transmitting content-related information retrieved from the 

25 media database to the appropriate client media player, the information is first wrapped 
using the transport protocol. The content-related information may be communicated 
using "XML" tags, or other appropriate programming, scripting, or markup language. 
The appropriate client media player upon receiving the transmission from the lookup 
server 54 unwraps the signal for processing therein by the cHent engine. 

30 The method and operation of the invention will be more fully understood by 

reference to the flow charts of FIG. 3 through FIG. 5. The order of acts as shown in 
FIG. 3 through FIG.5 and described below are only exemplary, and should not be 
considered hmiting. 

Referring now to FIG. 3, as well as FIG, 1 and FIG. 2, there is shown the 

35 processes associated with the chent engine 30 m accordance with the invention. As 
descnbed above, the client engine 30 is a software application or embedded system 
operating within a client media player for identifying media content played via the chent 
media player. 

At process 100, the client engine 30 is initiated. The chent engine 30 may be 
40 initiated by the user of client media player 14 or may initiate automatically upon the 
recognition of media content being played via the sound card 20 and/or the video card 
24. Box 110 is then carried out. 
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5 At box 1 10, the intercept unit 36 is initiated. As described above, the intercept 

unit 36 carries out the operation of monitoring media content played on client media 
player 14 (typically via sound card 20 and/or video card 24). The processes of the 
intercept unit 36 ai*e described more fully below in conjunction with FIG. 4. After the 
intercept unit 36 is initiated, box 120 is then cairied out. 
10 At box 120, a user request for media information is received via user interface 

38. This request is communicated from the user interface 38 to the samphng unit 34 tor 
further processing. Box 130 is then carried out. 

At box 130, the sampling unit 34 creates a media sample from the media data 
contained m the FIFO buffer 40. The media sample created by the sampling unit 34 
15 comprises a "digital fingerprint" using conventional digital signal processing known in 
the art. As noted above, a sound file may be sampled according to its 
acoustic/perceptual features over time. Box 14 is then carried out. 

At box 140, the sampling unit 34 transmits the media sample created from box 
130 to the lookup server 12 for content identification and content-related information. 
20 In the illustrative system depicted in FIG. 1 and FIG. 2, the media sample is first 
wrapped in a transport protocol and then communicated over IP (HTTP) via network 
interface 28. Box 150 is then carried out. 

At box 150, the client media player 14 receives from the lookup serv^er 12 the 
content-related information requested in box 140. The process for generating the 
25 content-related information by the lookup server 12 is described in further detail below 
in conjunction with FIG. 5. This content data is first received via the network interface 
28, unwrapped using the appropriate transport protocol and then communicated to the 
client engine 30. In the client engine 30, the user interface 38 receives the content- 
related information, and parses the data according to the appropriate format (XML tags, 
30 for example) transmitted by the lookup server 12. Box 160 is then carried out. 

At box 160, the user interface 38 presents the content-related information to the 
user via video card 24 and display 26, or other display device such as an LCD screen, or 
standard broadcast television. As described above, this content-related information may 
include such information as song tide, artist, and album name, for example. The content- 
35 related information may also include product fulfillment information, such as how and 
where to purchase media containing the media sample, advertising banners, and/or 
promotional offers, for example. 

Referring now to FIG. 4, there is generally shown the processes associated with 
the media intercept unit 36 in accordance with the invention. In general, the media 
40 intercept unit 36 maintains a FIFO buffer of predetermined size (150 seconds, for 
example) of media content played via client media player 14, 52a through 52n. 
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5 At process 200, the intercept unit 36 is initiated. This is carried out from box 

1 10 of FIG. 3, during the start up of the chent engine software 30. Box 210 is then 
carried out. 

At box 210, the intercept unit 36 monitors the media hardware devices of the 
client media player 14. As described above, the media hardware devices may comprise a 

10 sound card 20 and/or a video card 24. Other hardware devices suitable for playing media 
content are also monitored by the intercept unit 36, Diamond 220 is then earned out. 

At diamond 220, the intercept unit 36 determines whether any media hardware 
devices are playing media content. If the intercept unit 36 determines that media content 
is current being played, diamond 230 is carried out. Otherwise box 210 is repeated. 

15 At diamond 230, the intercept unit 36 determines whether the FIFO buffer 40 is 

currently full. If the buffer is determined to be full, box 240 is carried out. Otherwise, 
box 250 is carried out. 

At box 240, the intercept unit 36 has determined that all the buffer 40 is full and 
deletes the older sample in the buffer. As noted above, the FIFO buffer 40 is commonly 

20 configured with a predetermined size. For example, if the sampling rate is 22 KHz 
(HiloHertz), the olderst 1/22000* of a second sample would be deleted, and the newest 
1/22,000"^ of a second sample would be added to the buffer. Using this method, the 
intercept unit 36 is able to maintain the most recent one hundred fifty (150) seconds of 
media content played via client media player 14. Box 250 is then carried out. 

25 At box 250, the intercept unit 36 stores the media content currently being played 

on client media player 14 into the FIFO buffer 40. Box 210 is then repeated to continue 
monitoring media hardware devices. 

Referring now to FIG. 5, there is generally shown the processes associated with 
the server engine 16 and the lookup server 12 in accordance with the invention. It is 

30 noted that the lookup server 12 is structured and configured to handle a plurality of 
requests from a plurality of client media players as depicted in FIG. 2 above. 

Prior to box 300 as described below, the in-memory structure 47 is populated 
with reference media samples. Upon initialization of the lookup unit 42, the collection 
of reference samples from the media database 46 is stored in the m-memory structure 4 

35 At box 300, the lookup server 12 receives a media sample request from one of 

the client media players 14. 52a through 52n. As described above, such requests are 
generally communicated o\'er an Internet connection 32, although any suitable network 
connection may also be used. The request is first received via network interface 17 via 
the IP (HTTP) protocol. An appropriate transport protocol unwraps the message and 

40 communicates the request to the server engine 16 for processing. Box 310 is then 
carried out. 



13 



wo 01/62004 



PCT/USOl/04997 



5 At box 3 10, the lookup unit 42 sequentially compares each reference sample in 

structure 47 to the media sample provided by the media player 14, 52a through 52n, as 
described above. The lookup unit will cut the media sample into a series of overlapping 
frames, each frame the exact size of the particular reference sample under examination. 
The lookup unit 42 will then compare the reference sample to every possible "frame" 

10 of information within the media sample and compute a "distance" between each frame 
and the reference sample. This process repeats for each reference sample. Diamond 320 
is then earned out. 

At diamond 320, the lookup unit 42 determines whether a match was obtained. 
As noted above, the reference sample which has the smallest distance to any frame in the 
15 sample is considered for a match. If the distance is below a predefined threshold, a 
match is considered to be found. If a match is determined, box 330 is then carried out. 
Otherwise box 350 is carried out. 

At box 330, the media sample from box 300 matches a corresponding record in 
the media database 46, and the lookup unit 42 retrieves the content-related information 
20 associated with the matching record in the media database 46. As noted above, this 
content-related information may include such information as song tide, artist, album 
name, for example as well as other information such as product fulfillment information. 
Box 340 is then carried out. 

At box 340, the content-related information obtained in box 330 is transmitted to 
25 the client media player submitting the media sample request in box 300. In the preferred 
embodiment, this transmission is carried out over the Internet using the IP (HTTP) 
protocol. Box 370 is then carried out to generate a log. 

At box 350, the lookup unit 42 has determined that the media sample from box 
300 does not match any reference sample in the in-memory structure 47. Box 360 is 
30 then carried out. 

At box 360, a notice indicating that the requested information is not available is 
transmitted to the requesting client media player of box 300. Box 370 is then carried 
out. 

At box 370 the log process is initiated. This log process involves tracking the 
35 user and related-media information associated with the request of box 300. The log unit 
44 maintains a plurality of information related to the requests made by the user 
including, for example, media type, genre or category. This user information is 
maintained in the log database 48 and may be used for various marketing strategies, 
including analyzing user behavior, for example. 
40 Accordingly, it will be seen that this invention provides an apparatus and method 

which provides real-time media content-related information to users viewing media 
content over a personal computer, or other data processing means. Although tiie 
description above contains many specificities, these should not be construed as limiting 

14 
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5 the scope of the invention but as merely providing an illustration of the presently 
preferred embodiment of the invention. Thus the scope of this invention should be 
determined by the appended claims and their legal equivalents. 
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CLAIMS 

What is claimed is: 

1. A method for identifying media content presented on a chent media player 
comprising: 

10 a) creating a sample of a segment of the media content presented on the client 

media player; 

b) comparing said sample lo a collection of reference sampled media content to 
obtain content-related data from said collection of reference sampled media 
content; and 

1 5 c) presenting said content-related data on said client media player. 

2. The method of claim 1, wherein said collection of reference sampled media content is 
maintained in a lookup server in communication with said client media player, said 
method further comprising: 

a) transmitting said sample to a lookup server, before comparing said sample to 
20 said collection of reference sampled media content; and 

b) transmitting said content-related data to said client media player, before 
presenting said content-related data. 

3. The method of claim 1 further comprising receiving a user-initiated command to 
identify said media content before creating a sample of a segment of said media content. 

25 4. The method of claim 1 wherein said creating a sample of a segment of said media 
content is carried out periodically. 

5. The method of claim 1 wherein said creating a sample of a segment of said media 
content is carried out continuously. 

6. The method of claim 1 wherein said creating a sample of a segment of said media 
30 content further comprises maintaining a fixed buffer of media presented on said client 

media player. 

7. The method of claim 1 wherein said creating a sample of a segment of said media 
content further comprises creating a digital fingerprint of said media content presented 
on said client media player. 

35 8. The method of claim 1 wherein said transmitting said sample to a lookup server is 
carried out over an Internet connection. 

9. The method of claim 1 wherein said transmitting said content-related data to said 
client media player is earned out over an Internet connection. 

10. The method of claim 1 wherein said media content is audio content. 
40 11. The method of claim 1 wherein said media content is video content. 
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5 12. A program storage device readable by a machine, tangibly embodying a program of 
instructions executable by the machine to perform a method for identifying media 
content presented on a client media player, said method comprising: 

a) creating a sample of a segment of the media content presented on the client 
media player; 

10 b) comparing said sample to a collection of reference sampled media content to 

obtain content-related data from said collection of reference sampled media 
content; and 

c) presenting said content-related data on said client media player. 

13. The program storage device of claim 12, wherein said collection of reference 

15 sampled media content is maintained in a lookup server m communication with said 
client media player, said method further comprising: 

a) transmitting said sample to a lookup server, before comparing said sample to 
said collection of reference sampled media content; and 

b) transmitting said content-related data to said client media player, before 
20 presenting said content-related data. 

14. In a network having at least one client media player operatively coupled for 
communication with a lookup server, a system for identifying media content presented 
on the client media player comprising: 

a) a client engine executing within the client media player, said client engine 
25 structured and configured to sample media content played on said client 

media player into a media sample and communicate said media sample to 
said lookup server; 

b) a collection of reference sampled media content containing a plurality of 
sampled media content, each associated with corresponding content-specific 

30 information; and 

c) a server engine executing with said lookup server connected to said 
collection of sampled media content, said server engme structured and 
configured to receive said media sample from said client media player and 
compare said received media sample with said plurality of sampled media 

35 content in said collection of reference sampled media content to query and 

ascertain content information related to said received media sample, said 
server engine further structured and configured to transmit said content 
information to said client media player. 

15. The system of claim 14 wherein said client engine is further structured and 
40 configured to display said content information. 

16. The system of claim 14 wherein said media content is audio content. 

17. The system of claim 14 wherein said media content is video content. 
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10 



18. In a network having at least one client media player operatively coupled for 
communication with a lookup server, a client engine executing within the client media 
player for identifying media content presented on the client media player comprising: 

a) an interrupt unit structured and configured to monitor the client media player 
for media content presented thereon; 

b) a sampling unit structured and configured to generate a media sample from 
said media content monitored by said inten^upt unit and transmit said media 
sample to said lookup server; and 

c) a user interface structured and configured to receive content infomiation 
related to said media sample from said lookup server and presenting said 

15 content information. 

19. The client engine of claim 18, wherein said user interface is further structured and 
configured to receive user-requests from a user of said client media player, said 
sampling unit further structured and configured to generate said media sample upon 
receipt of said user request. 

20 20. In a network having at least one client media player operatively coupled for 

communication with a lookup server, a server engine executing within the lookup server 
for identifying media content presented on the client media player comprising: 

a) a collection of reference sampled media content containing a plurality of 
sampled media content, each associated with corresponding content-specific 

25 information; and 

b) a lookup unit coupled to said collection of reference sampled media content, 
said lookup unit structured and configured to receive a media sample from 
said client media player and compare said received media sample with said 
plurality of sampled media content in said collection of reference sampled 

30 media content to query and ascertain content information related to said 

received media sample, said server engine further structured and configured 
to transmit said content information to said client media player. 

21. The server engine of claim 20 wherein said received media content is audio content. 

22. The server engine of claim 20 wherein said received media content is video content. 
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